Unit of observation

Relevel the categorical variables without missing values to reduce the columns of dummy variables

Remove outliers

variable transformation

Age has a weak positive correlation with satisfaction Class also has a weak correlation with checkin service and onboard service poor, this could be because business class passengers have high expectations of service that should be offered to them and it is more likely that falcon airline is meeting up. flight distance is positively correlated to delay on both arrival and departure but it doesn't mean that the longer the distance the longer the delay, however short or longer distance could be taken by the airline to save cost. arrival delay in min shares a strong positive correlation with departure delay in mins Seat comfort is positively correlated to baggage handling, it could be that if all bags are placed in the over head compartment passengers could be more comfortable Gate location is strongly correlated with food drink that was rated excellent by passengers Inflight entertainment is postively correlrated to inflight wifi service Leg room, cleanliness,baggage handling,seat comfort and Onboard service rated poor by passengers share a weak or poor correlation

Exploratory data analysis

Evaluating satisfied

Logistic regression

Optimal Threshold

Decision trees

Pruning

Hyper parameter tuning

Random Forest

Hyper parameter

Gradient Boosting Classifier

Adaboosting

Gradient Boosting

Xgboost

Stacking Classifier

Model Comparison